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^ Abstract 

^ We consider fault-tolerant quantum computation in the context where there are no fresh 

5 ancilla qubits available during the computation, and where the noise is due to a general quantum 

channel. We show that there are three classes of noisy channels: In the first, typified by the 
depolarizing channel, computation is only possible for a logarithmic time. In the second class, 
of which the dephasing channel is an example, computation is possible for polynomial time. The 
O ^ amplitude damping channel is an example of the third class, and for this class of channels, it is 

^ possible to compute for an exponential time in the number of qubits available. 

^ 1 Introduction 

The threshold theorem jAB97[ IKLZ98[ IKit97[ IA(5;P05j is the central result of the theory of fault- 
^ tolerant quantum computation. It states that, provided the error rate per gate or time step is 

ly^ below some constant threshold value, then arbitrarily long quantum computations are possible 

On with only polylogarithmic overhead. The threshold theorem tells us that large quantum computers 

are possible in principle, provided experimentalists can achieve an error rate below the threshold 

value. 

^ Naturally, then, it is of great interest to determine the precise physical assumptions which are 

CO needed for a threshold error rate to exist. Some assumptions made in the simplest versions of 

^ the threshold theorem can certainly be relaxed. For instance, a threshold exists when gates are 

^ restricted to nearest-neighbor interactions [AB08[ IGotOOj , or when the noise is due to very general 

^ weak interactions with a non-Markovian environment |AGP05i 'TB05]. Other assumptions are truly 

Vjj needed. For instance, it must be possible to perform gates in parallel, and the noise process must 

not cause many qubits to fail simultaneously in a correlated way. 

One assumption that has been traditionally classified as required is the need for fresh ancilla 
qubits in the course of the computation. In the course of quantum error correction, ancilla qubits 
are introduced and used to record the error syndrome. This can be viewed as a refrigeration process, 
where entropy which has been introduced into the data qubits by the noise gets pumped out into 
the ancilla qubits, cooling down the data qubits. In order for this to work, the ancilla qubits used 
must be cold themselves, or they cannot absorb the extra entropy from the data. Since ancilla 
qubits created at the beginning of the computation are themselves subject to the noise process, we 
can expect them to heat up over time, eventually making them worthless for refrigeration. Based 
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on this intuition, we expect to need a continual stream of new, freshly cooled ancilla qubits to keep 
the error correction running. 

At a rigorous mathematical level, this intuition is supported by the result of |ABIN96] . which 
showed that for noise in the form of a depolarizing channel, it is impossible to compute for longer 
than O(logn) time. We study more general channels and reach a different conclusion. In particular, 
we prove the following main theorem. The precise statements for what the upper and lower bounds 
mean in each case are deferred to the appropriate sections. 

Theorem 1.1 (Main Theorem). Let C be any non-unitary channel close to the identity, and 
consider quantum computations which suffer from noise C on each qubit at each time step. Up to 
unitary equivalence, the limit of repeatedly applying C can be either a point or a diameter in the 
Bloch sphere^ 

1. If the limit is the center of the Bloch sphere, it is possible to compute for 0(log(n)) time steps. 
This is tight up to log log n factors. An example of this class is the depolarizing channel. 

2. If the limit is a diameter, than it is possible to compute on 0{n^) qubits for 0{n^) time steps 
provided a-\-b < 1, and impossible even to store a single (unknown) qubit for more than 0{n^) 
time steps. An example of this class is the dephasing channel. 

3. If the limit is a point which is not the center of the Bloch sphere, it is possible to compute 
for an exponential number of time steps. An example of this class is the amplitude damping 
channel. 

For the third case, we expect that it is not possible to compute for longer than exponential 
time, but we have not managed to prove this. 

The three classes of channels can also be characterized in terms of how they treat the entropy 
of a state. The depolarizing class causes entropy to increase for all states until it reaches the 
maximum for the completely mixed state. For the dephasing class of channels, entropy is merely 
non-decreasing: For some states it increases, and for others it stays the same. For the amplitude 
damping class of channels, entropy can decrease under the channel. 

The result with the greatest practical significance is 3, as it enables quantum computation 
without fresh qubits. This doesn't really violate the intuition discussed above, it is just that for 
this class of channels, we can use the noise itself to cool the ancilla qubits. The rest of the paper 
is devoted to proving the main theorem. Here we give some intuition to the proofs of 1-3. 



Limit (C) is the center of the Bloch sphere: The proof is just a slight generalization 
of [ABIN96]. and uses the same techniques. 

Limit (C) is a diameter: The noise is equivalent to the phase damping channel. Computing 
for O(n^) time steps can be done by running the computation on O(n^) qubits, leaving 
n — 0{n^) qubits in the state |0) (or some other pure state which is unaffected by the noise). 
At each stage of the computation it is possible to take fresh qubits which were unaffected by 
the noise. Since at most qubits are needed every step, the computation can go for time 



(theorem 5.1) 



Proving the other direction is a little tricky, as it is not clear how fast the entropy rises. 
We will show that it is impossible to store half of an EPR pair for more than O(n^) time. 



^If C is a dephasing channel fonowed by a unitary rotation, the hmit point of C appUed repeatedly may be the 
center of the Bloch sphere, but we still consider it to be part of the 2nd class. Therefore, we choose the largest set 
of fixed points of a channel unitarily equivalent to C. See section |3] for more details. 
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To do this, we take advantage of the fact that the entropy cannot decrease. Therefore, if 
the state is stored for a long time, there must be some time step during which the entropy 
rises very httle. For the dephasing channel, this can only happen if the qubits are essentially 
unentangled and diagonal in the standard basis. In particular, further dephasing will have 
little effect on them. But a completely dephased state has no entanglement with the outside 
world at all, and therefore we have failed to maintain the EPR pair. 

• Limit (C) is a point which is not the center of the sphere: In this model we use 0{n) 
qubits instead of n qubits, and divide the system into three parts. 

The first part is just a standard compilation of the original circuit to a fault-tolerant compu- 
tation, just as if fresh ancillas were available. Denote by the number of qubits which the 
fault tolerant computation uses. We have that = O(^), and that at each step the fault 
tolerant circuit asks for at most fresh ancilla qubits. 

The second part of the computation is a storage house. This part holds n^T chunks of data. 
The size of each chunk is i?, and at each stage of the computation chunks are passed to 
the refrigerator. 

The last part is the refrigerator, which is actually built of O(n') refrigeration units which 
work in parallel. Each such unit takes a typical sequence of R qubits (where each qubit in 
the sequence is very similar to the limit of C), and condenses the entropy to R — 1 qubits 
and to a fresh qubit. Then the refrigerator passes qubits to the computation, which uses 
as many as it needs. The dirty qubits left over from the refrigerator, the qubits used up by 
the computation, and any unused fresh qubits are recycled by being sent back to the storage 
house. 

The constant R is chosen so that it will be possible to condense the entropy. T is chosen such 
that after T applications of C the qubit will be close enough to the limit of C. We take the 
noise to be weak enough so that not too many mistakes occur during the process in which 
the refrigerator condenses the entropy. 

1.1 Previous results 

We only state a few fundamental results in this field which are important for our constructions. 
The first and foremost important results are the threshold proofs. |AB97j and [AGP05] prove that 
quantum computation is possible under any noisy channel if the channel is weak enough (but still 
a constant), and if fresh qubits are introduced. |KLZ98| and |Kit97| also proved the threshold 
theorem, but only for stochastic noise. 

Theorem 1.2 (Threshold Theorem). There exists a constant threshold value pT > with the 
following property: Suppose a computation undergoes noise C at each location ( each gate and each 
time step), with ||C — /||o < pr- Then given any quantum circuit with depth t acting on n qubits, 
there exists a fault-tolerant simulation of the circuit whose output has statistical distance at most 
e away from that of the ideal circuit. The fault-tolerant simulation uses 0(npolylog(nt/e)) qubits 
and has depth 0{tpoljlog{nt/e)) . 

An assumption of the threshold theorem is that fresh ancilla qubits can be supplied at each 
step. If those ancilla qubits are not available, the threshold theorem no longer directly applies. 
II • II is the diamond norm [KSV02j, equivalent in this case to the completely bounded operator 
norm. 



3 



Our paper can be viewed as a generalization of jABIN96j . |ABIN96j showed that for the 
depolarizing channel, computation is impossible for more than 0(log(n)) steps. Razborov |RazQ6j 
used a similar argument to show that a very noisy channel is useless. These two results could 
create the false impression that quantum computation is impossible without the introduction of 
fresh qubits. To the best of our knowledge, phase damping, amplitude damping, or more complex 
kinds of noise did not receive a full treatment. 

2 The Model 

We will work in a model in which gates are perfect, but after each time step, each qubit undergoes a 
single-qubit channel C. Thus, we alternate layers consisting of gates with layers consisting of noise. 
Every qubit undergoes the same noise at each time step, and there is no memory or correlation 
between different qubits or different time steps. One could consider more complicated models in 
which qubits undergo different kinds of noise depending on the gates performed on them (i.e., 
allowing gate errors), or in which different qubits have different error models, but we believe this 
would simply add complication without fundamentally changing the results. 

The lower bounds (showing that computation is possible for a certain amount of time) are 
proven by allowing arbitrary single-qubit and two-qubit gates in parallel at each time step, as is 
usual for fault-tolerant constructions. The upper bounds (showing that computation is impossible 
for more than a certain amount of time) are more generous, and allow arbitrary many-qubit unitaries 
between the applications of noise at each time step. 

Because the only control available to us is via unitary gates, it is not possible to measure error 
syndromes and perform corrections based on classical processing of the outcome. However, (AB08j 
showed that a threshold still exists when all control is via unitary gates and measurement in the 
middle of the computation is not possible. Therefore, we will use this version of the threshold 
theorem throughout the paper. 

3 Classification of Qubit Channels 

Using the Bloch sphere representation of a qubit, any density matrix p can be written as 

p=l(/ + wa), (1) 

where w is a real vector with norm < 1 (i.e., inside the Bloch sphere) and a is the vector of 
Pauli matrices {X,Y,Z). In this representation, the action of any quantum channel C can be 
written pCEln] as C{p) = UC'{VpV^)U\ with 

C'(l[/ + wc7]) = ^[/+(t + rw).c7], (2) 

such that T is diagonal, t = iff the channel C is unital (meaning C{I) = /). That is, up to 
unitaries U and V ^ we can regard the channel as shifting the Bloch vector w and rescaling it in the 
X, y, and Z coordinate axes. In order for the channel to be positive, the output vector t + Tw 
must also have norm at most 1. 

In the future, we will ignore the unitaries U and V . This is because they are single-qubit 
unitaries and can be absorbed into the gate layers before and after the noise. Furthermore, if t = 
and T = /, C is actually unitary and there is no noise. 
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There are unital maps (the t = case) for which ah three eigenvalues of T are less than 1. In 
this case, all vectors w shrink towards the origin, and the only fixed point of the map is the center 
of the Bloch sphere, the maximally mixed state. This gives us the first class of maps, including the 
depolarizing channel 

C{p) = (1 - p)p + ^{XpX + YpY + ZpZ). (3) 

In fact, a map in this class is always a Pauli channel^ with probabilities pxi Py^ and pz of errors 
X, y, and Z: 

C{p) = {1 - px - py - Pz)p + PxXpX + prYpY + pzZpZ. (4) 
We can express px, PYi and pz in terms of the eigenvalues Ax, Ay, and \z of T, for instance 

^x = ^(l + Ax-Ay-Az). (5) 

By |KR01| . |Ay + A^l < |1 + Ax| for a completely positive map, so px > 0. Therefore any unital 
qubit channel corresponds to a sensible Pauli channel, up to unitary rotations. 

Next, we can consider unital channels for which T has one or two eigenvalues equal to 1. We 
may assume without loss of generality (again, up to unitary rotations) that the Z direction is 
one that is not contracted, so the Z eigenvalue Xz = 1. Then, by [KR01| . in order for C to be 
completely positive, \Xx — Ay| < |1 — A^| = 0. Therefore, only one eigenvalue can be 1 and the 
other two directions must contract by the same amount. This can be achieved with the dephasing 
channel 

C{p)^{l-p)p + pZpZ. (6) 

These channels leave the Z axis fixed and all other points contract towards it. This is the second 
class of channels. 

Finally, we have non-unital channels. This is the third class of channels. One such channel is 
the amplitude damping channel 

A qubit non-unital channel always has a unique fixed point: Since t ^ and the output vector 
t + Tw must have norm at most 1 for all w, it follows that T has all eigenvalues strictly less than 
1, so the map is strictly contractive. Therefore it has a unique fixed point, which we can identify 
as the vector 

tx ty tz 



Xx ' 1 — Ay ' 1 — A^ / ^ ^ 



4 Depolarizing Class 

In this section we prove that n qubits suffice for a circuit of depth 0(log(n)). 

For the lower bound, we invoke a result from jABIN96 ] : 
Theorem 4 from jABIN96j . // a boolean function f can be computed by a quantum circuit of 
size s and depth d, then f can be computed by a noisy quantum circuit of size 0(5 polylog(5)) • 
20(rfpoiyiog(d)) ^g^^/^ 0(dpolylog(d)). 

The theorem is stated only for depolarizing noise, but their proof works for any weak Pauli 
channel, which, as discussed in section [3) covers all unital channels. Basically, the argument is to 
continually purify ancillas by comparing them to each other with majority gates. The number of 
reliable ancillas available drops exponentially with time, which produces the exponential factor in 
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the size. The purified ancillas are used to run a regular fault-tolerant protocol. The consequence 
of this theorem is that using n qubits, it is possible to run any computation of O(logn) depth. 

To show that we can compute for a time at most O(logn), we can use essentially the same 
proof as for Theorem 3 of [ABI N96] . Let p = min{px^PY^Pz)' Let Dp be the depolarizing channel 
with probability p. Dp is no more noisy than the actual Pauli channel C, and by concavity of 
the entropy, the entropy increases at least as much under C as it does under Dp. By lemma 8 of 
[ABIN96J . the information I{p) = n — S{p) decreases by at least a factor 1 — p under Dp^ so it also 
decreases by at least a factor 1 — p under C. The rest of the proof is the same as in |ABIN96] . 

5 Dephasing channel 

In this section we consider a dephasing channel which acts on each qubit independently. We give 
polynomial upper and lower bounds for the ability to compute. 
Consider a dephasing channel p ^ {1 — p)p + ZpZ^ . 

Theorem 5.1. There exists a constant pT, such that if p < pT, then one can reliably perform any 
computation which involves 0{n^) qubits for 0{n^) time, provided a + b < 1. 

Proof. The standard techniques of fault tolerance can be used to compile a circuit into a fault 
tolerant circuit, provided the noise is below some threshold pt and that there is a supply of fresh 
qubits. If the original circuit acted on m qubits and runs for a time t, the fault tolerant one acts 
on m' — compolylog(mt) qubits and has depth t' — cit polylog(mt). 

Suppose m = 0{n^) and t — 0{n^). Given n physical qubits, put aside n — m' oi them, and 
run the fault-tolerant simulation of the circuit on the remaining m^ The qubits that have been 
set aside begin in the state |0), so are unaffected by the dephasing channel. At each time step, the 
fault-tolerant circuit calls for up to m' fresh ancillas. We can supply them out of the qubits set aside 
at the beginning. After a qubit is used as an ancilla, it is not used again. Therefore, we run out of 
fresh ancillas after a time (n — m')/m' — Q{n^~^ / polylog(mt)). This is sufficiently long to run the 
full computation if t' < {n — m')/m'^ in other words, if 0(n^ polylog(mt)) < Q{n^~^ / polylog(mt)). 
This win be true if a + 6 < 1. □ 

We also prove an impossibility result: 

Theorem 5.2. Begin with an EPR pair, with one qubit in a perfect reference system R and the 
other qubit stored in a noisy quantum computer, which undergoes a dephasing channel acting on 
each qubit with probability p at each time step. For any p > 0, after a time T — g^x^^je^; (^'^V 
quantum circuit applied to the noisy system will still lead to a state which has distance (in the 
2-norm) at most e from a separable state with the reference system. 

Proof. The proof is by contradiction. Suppose that there is a computation which could protect half 
an EPR pair against dephasing noise, such that the original qubit could be recovered (with high 
fidelity) after T steps. We track the evolution of the computation. 

At any time step, let Qi, . . . ,qn denote the qubits before the noise is applied, and let q[, . . . ^q^ 
denote the qubits after the noise. R is unaffected by the noise. We have that 

S{q[, ...,q',,R)^Yl S{qU+i,i+2, ■■■,CR) + S{R) > J] ...,qn,R) + S{R) (9) 
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because conditional entropy S{A\B) cannot decrease under a CP map on B (which can only lose 
information about A). Furthermore, S(q[\qi^i^ . . . ^qn^R) > S(qi\qi^i^ . . . ^qn^R) since a dephasing 
channel cannot decrease the entropy. This means 

Siq[, ...,CR)> Siqi, ...,qn,R) + [S{q[\q2, ...,qn,R)- S{qi\q2, ...,qn, R)]- (10) 

Naturally, this holds for any order of the qubits. 

^ ~ ^\\n2)n^ • every step of the computation, if there is at least one qubit qi for which 
S{ql\q-i) > S{qi\q-i) + 5, then the entropy of the whole state increases by at least 5 by (10). Here 
S{qi\q-i) means the conditional entropy of qubit qi conditioned on all other qubits in the system, 
including R. 

However, since the entropy is bounded by n, and the computation stores the encoded half of an 
EPR pair for T — n/5 time, there must be a time step t in which the entropy increase is at most 
5. At this time step, it must be that for every qubit qi 

S{(iM-i)-S{qi\q-i)<5 (11) 

We show that the decoding procedure will fail if applied after time t, by showing that at time 
t the state on the n qubits is close to diagonal, and therefore the entanglement to the reference 
system (the other half of the EPR pair encoded by the computation) is lost. 

At time t we have for every qubit qi 

S{q'M.i) - S{qi\q.i) = S{pp + (1 - p)ZipZi) - S{p), (12) 

where p is the joint state of all qubits in the computer and R. At this point, concavity of the 
entropy tells us that the RHS must be non-negative, but we wish to set a tighter bound in order 
to compare with the upper bound of 6. 

Pinsker's inequality (Theorem 8.6 in [ WatOSj ) gives a lower bound on the relative entropy: 

s{pm>^\\p-^\\l (13) 

Concavity of the entropy can be proved from sub-additivity, and sub-additivity follows from the 
positivity of the relative entropy, so Pinsker's inequality can give us a tighter version of concavity: 

s[(i - p)p + p^] - [(1 - p)s{p) + psm > ^^^^ Hp - eili- (14) 

Using this, we get that for any state ^, if ^[(l —p)p^pi] — [(1 —p)S{p) +pS'(^)] < 5, then 

Plugging in ^ = ZipZi gives 

\\]^{p + ZipZi)-p\\2 = \\\ZipZi-p\\2 < ^\/(ln2)(5/(2p(l-p)), (16) 

SO dephasing p on qubit i doesn't change it much. Since this holds for every z, we can apply the 
triangle inequality and get 

lk-p||2<n/2y(ln2)V(2p(l-p)), (17) 
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where a is the diagonal density matrix obtained by completely dephasing p on every qubit in the 
computer (but not R) . Plugging in 5 — ^^|^~^^^2^ gives that 

h-ph<e. (18) 

A perfect decoding map applied to the system at this point to try to extract the EPR pair cannot 
change the entanglement with the reference system R. Since the completely dephasing map is an 
entanglement-breaking channel, a decodes to a separable state with R. Therefore, p decodes to 
a state with 2-norm distance at most e from a separable state, which is necessarily far from the 
original EPR pair. 

□ 

6 Non unital channels 

Theorem 6.1. For any single-qubit density matrix P ^ 1/2, there exists a threshold value pt such 
that if C is any non-unital qubit channel with — /||o < pr and fixed point P, then it is possible to 
perform any quantum computation of width n and depth D using 0(npolylog(ni?)) physical qubits 
and depth 0{D Y)o\y\og{nD)) , where each physical qubit undergoes the noise C at each time step. 

Note that the constants in the depth and number of physical qubits will depend on the channel 
C, and that the threshold may depend on P. 

Proof. The basic idea is to perform a concatenated fault tolerance protocol, as usual for threshold 
proofs, but the fresh ancillas usually used for a threshold protocol will instead be provided by a 
separate part of the computer, set aside to act as a refrigerator. In particular, the qubits in the 
computer will be broken up into 3 components: 

1. The computation to be run, using error correcting codes and assuming a fresh supply of qubits. 
This is possible using standard fault tolerant techniques (see [A B971 iKitOTl IKLZ98[ [A(;P05j 1 . 
This computation assumes the error rate per step is below the threshold for fault tolerance, 
so Pt must be at least as small as the usual threshold. 

2. A storage house for ancillas. This is just a large (but linear in the size of the computation) 
array of chunks of qubits. Qubits are going to stay in the storage house for a long time. This 
means that they will undergo the noise C many times before being used again. 

3. A refrigerator. This component takes a chunk out of the storage house, and "cools down" a 
part of it, by encoding all the information in the beginning of the chunk, and leaving a few |0) 
qubits in the end. The cooling process does not violate the second rule of thermodynamics, 
as the noisy channel leaves each qubit with non-maximal entropy. Rather the refrigerator 
condenses the entropy using algorithmic cooling |SV99j . and supplies the computation (com- 
ponent 1) with fresh bits. 

At the start of the computation, the ancilla qubits are taken from a fresh supply. Once they 
have been used, ancilla qubits are then passed to the storage house where they will sit for a long 
time T. After a time T passes, groups of R qubits are removed from the storage house and placed 
into the refrigerator. The output of the refrigerator is 1 "reset" qubit and R-1 "waste" qubits. The 
reset qubits are then passed to the computation section to act once again as ancillas and the waste 
qubits are returned to the storage to wait again. For simplicity, let us assume the storage house 
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Figure 1: The computer consists of three components: A computation component, a storage house, 
and a refrigerator. 

begins fuh of qubits in the state P, so that the refrigerator can begin operating immediately, and 
that the initial supply of fresh ancilla qubits for the computation component only need last until 
the refrigerator finishes its first cycle. 

The computation component runs a standard concatenated fault-tolerant protocol. By the 
threshold theorem, if fresh ancillas are available, a protocol exists using n' — O(npolylog(ni?/eo)) 
physical qubits and depth — 0(i? polylog(ni?/eo) which achieves a final state with 1-norm 
distance at most eg from the correct final state of the computation. Here "fresh" ancilla qubits does 
not mean they are perfect; it is sufficient that they are prepared by a process that is independent 
from ancilla to ancilla such that each prepared state has 1-norm distance from |0)(0| which is less 
than the threshold value. In our case, we do not have fresh ancillas; the ancillas will instead be 
provided by the other components of the protocol, and may have errors which are very slightly 
correlated with each other. However, the computation component is designed just as if the ancillas 
were actually fresh. 

The time T is chosen so that any qubit placed in the storage house for a time T (during which 
time it experiences no gates other than the channel noise C) reaches a state very close to the 
fixed point P of C. Let Cp be the channel which throws away a qubit and replaces it with P\ 
Cp{p) — P. Then we choose T so that || - Cp ||o < ei/{n'D'R). Since C is strictly contractive, 

approaches Cp exponentially, so we may choose T — 0{\og{n' D' R)) — 0{\og{nDR)) to achieve 
a constant value of ei. 
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The refrigerator is a compression algorithm, chosen so that when run with an ideal circuit on 
R qubits in the state P, the output is 1 "reset" qubit in a state R and R — 1 "waste" qubits in 
any other state. R is chosen so that \\R — |0)(0|||i < 62. The state of the waste qubits is not 
important, but typically will be close to maximally mixed, indicating we have optimally purified a 
state. Also note that the waste qubits may be entangled with the reset qubit, but they cannot be 
very entangled, since R is close to a pure state. A refrigerator with these properties can be achieved 
with a constant value of R and a quantum circuit of constant size F. The size F is a count of the 
total number of locations in the refrigerator circuit, including wait steps. {R and F will in general 
depend on P, or at least on S{P).) 

Now, in the actual computation, the refrigerator experiences noise as well. However, at each 
step, the noise is close to the identity. A single run of the refrigerator algorithm with the real 
system therefore will therefore have distance at most FpT from the ideal refrigerator. Therefore, 
the output of one run of the refrigerator algorithm, if it is run with noise, but with all input qubits 
in exactly the state P, has distance at most 62 + FpT from the state |0)(0|. Let us choose 62 and 
Pt so that 62 + FpT is below the threshold for fault tolerance. Also note that at the start of the 
computation, ancilla qubits used before the first run of the refrigerator ends have experienced at 
most F locations as well, so their error rates are also below the threshold. 

Therefore, if the storage house is replaced by a black box which performs Cp on each qubit 
placed in it for a time T, the refrigerator outputs qubits which are sufficiently close to perfect to 
satisfy the requirements of the threshold theorem. Also, since the black box treats each qubit in 
the storage house separately, and the refrigerator algorithm is independent for each reset qubit, the 
outputs of the refrigerator will, in this version of the protocol, be independent. Therefore, when 
there is a black box storage house, the protocol satisfies the conditions of the threshold theorem, 
and the final state of the protocol has distance at most eo from the correct output of an ideal 
protocol without noise. 

However, the actual effect of the storage house is on each qubit, which is very close to Cp. 
Therefore, the true protocol will be very close to the one with a black box storage house. How 
close? To figure this out, we need to determine the total number M of qubits that get put through 
the storage house. Since \\C^ — Cp\\o < ei/{n'D'R) for a single qubit, the full protocol will be a 
distance at most Mei/{n'D'R) from the one with a black box storage house. When M is not too 
large, even though there may be residual correlations between the ancilla qubits exiting the storage 
house, the correlations are too weak to have a big effect on the protocol. 

At each time step, the computation component uses at most n' ancillas, each of which must be 
output by a refrigerator protocol which uses R qubits from the storage house. Therefore, we need 
at most n' R qubits to come out of the storage house at each time step, and there are at most D' 
time steps in the full fault-tolerant protocol, so M < n'RD' . The overall protocol thus has distance 
at most ei from the black box protocol, which in turn outputs a final state with distance at most 
eg from the ideal protocol. 

By choosing 60 = 61 = 6/2 for any constant 6, we therefore get a fault-tolerant protocol which 
outputs a state within distance e of the correct output state, as desired. At any time, the com- 
putation component uses n' qubits, the storage house contains at most TRn' qubits, and the 
refrigerator contains at most FRn^ qubits. Therefore the total number of qubits used is at most 
{1 + TR + FR)n' = 0{Tn') = 0(n polylog(nD)). The depth blow-up of the computation is the 
same as for the usual fault-tolerant protocol; that is, the depth is D' = 0(i?polylog(nD)). □ 

While this protocol works to simulate a circuit of any depth D, there is an overhead cost 
that depends polylogarithmically on D. Therefore, if D is superexponential, the number of qubits 
needed will be superpolynomial. Probably it is impossible to compute for more than exponential 
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Figure 2: Each block of the code has a separate storage house and refrigerator for a two-dimensional 
layout. 



time; for instance, for the amplitude damping channel, there is a probability at least p'^ at each 
time step that all qubits will relax to |0), leaving no remaining information about the input to the 
computation. However, we have not yet been able to prove this for a general non-unital channel. 

It is worth noting that theorem 6J_ still holds when quantum gates are restricted to nearest- 
neighbor interactions with a two-dimensional geometry, but we are not certain if it holds or not 
for a one-dimensional architecture. To implement the procedure in two dimensions, we lay out 
the "computation" component of the system along a line. It is possible to do this and still have 
a threshold error rate for fault tolerance |AB08t IGotOO] . The ancilla qubits needed for operations 
on each block of the quantum error-correcting code used in the protocol are prepared in a strip of 
constant width laid out orthogonal to the computation qubits, as in figure [2| Each strip contains 
a separate "storage" component and "refrigerator" component. In the context of a concatenated 
code, only the ancillas needed for operations at the lowest-level blocks need to be reset — higher- 
level ancilla qubits are built up from individual physical qubits, and when the physical qubits are 
nearly pure, the higher-level ancillas can be made from them in the usual way. 

Each strip is itself divided into two halves. In one half, qubits are moved away from the 
computation component by one place each time step. In the other half, qubits are moved one 
step closer to the computational component at each time step. At the end of the strip away from 
the computation component, qubits are shifted from the half of the strip moving away to the half 
moving towards the computational component. Thus, if there were no other operations going on. 
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qubits would travel to the end of the strip and then come back. The part of the strip moving away, 
and perhaps part of the half returning to the computational component, is devoted to the storage 
component. Qubits in it undergo no operations beyond moving along the strip. We make the strip 
sufficiently long so that qubits remain in the storage house for a long enough time T to be close to 
the fixed point P. 

The last part of the returning half of the strip implements the refrigerator component. The 
algorithmic cooling used in the refrigerator uses only a constant size circuit to prepare a constant 
number of ancillas (which is all that is needed for a single time step for a single block of the quantum 
error-correcting code), so a strip of constant width is adequate for this purpose. Once qubits in 
the strip have reached the computational component, they have been separated out into "reset" 
qubits, which are almost pure and can be used as ancillas in the computation, and "waste" qubits, 
which are returned to the storage house in the outgoing half of the strip. 

In one dimension, this strategy doesn't appear to work. We could attempt to intersperse level 
1 computational blocks with storage and fridge blocks, but as the error rate gets small, the storage 
house must get bigger and bigger since it takes longer for qubits to relax to the fixed point of the 
channel. This means the computational blocks get further and further apart, making it costly to 
do gates between them, which in turn means the error rate for logical gates is higher, offsetting the 
advantage of the fault-tolerant protocol. It is not clear if there is a cleverer approach which allows 
exponentially long computations in one dimension with non-unital noise. 

Note also that in this design, we are assuming that the original circuit to be simulated is laid 
out using nearest-neighbor gates in one dimension, yet we are implementing it in two dimensions. If 
the ideal circuit is instead given to us in a two- or higher-dimensional layout, we have two choices. 
Either we can use the approach above and add one dimension to the layout for the storage house 
and refrigerator components, or we can rewrite the ideal circuit into a one-dimensional one and 
then implement it in two dimensions. The latter choice potentially incurs an 0{n) blow-up in the 
depth of the circuit. It may be that there is an alternative approach to simulate two-dimensional 
circuits without fresh ancillas without adding a dimension and with only polylog overhead. 

7 Conclusion 

We have shown that the usual claim that a quantum computation needs fresh ancilla qubits supplied 
from the outside is not completely correct. Rather, we have shown that if the noise in the system is 
non-unital, then we can co-opt the noise to create fresh ancilla qubits, allowing us to recycle qubits 
and perform computation even in an otherwise closed system. We can do this because non-unital 
noise provides some cooling, taking a maximally mixed state to a less mixed state. For instance, 
if the system is in thermal contact with a cold bath, qubits left in the storage house will tend to 
equilibrate to the temperature of the bath. Viewed this way, of course the system is not truly 
closed, but it is open in only a single limited way. 

Moreover, our protocol works for any fixed point of the non-unital map. Therefore, even a 
very hot bath, provided it is at finite temperature, can provide enough cooling to leverage into a 
full-blown fault-tolerant computation. Of course, the overhead is very high when the fixed point P 
is close to maximally mixed, but it is perhaps surprising that it can still be done at all. One might 
instead have expected a threshold temperature above which the system is too hot to compute. 
There is a threshold error strength^ but that is a constraint on the coupling to the bath rather than 
the temperature of the bath. 

The dephasing class of channels is another interesting case, sitting on the borderline between the 
depolarizing class, for which long computations are impossible, and the amplitude damping class. 
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for which computation is possible. A pure dephasing channel, with no other sources of noise at all, is 
physically improbable, but in many physical systems, dephasing is the dominant error source. This 
suggests that in early quantum computers, it might be sufficient to keep a stock of ancilla qubits 
from the start and use them up over the course of the computation — the polynomial overhead 
we derive for this case — and only resort to resetting ancilla qubits either via external control or 
internal refrigeration once larger quantum computers are built. Indeed, this is the strategy already 
envisioned for many early quantum computers. The dephasing class is also subject to an interesting 
gap between the upper and lower bounds. Certainly it is possible to store a single EPR pair for 
0{n/ logn) time using a good QECC, but we have only been able to show that the upper bound on 
the storage time is O(n^). Can this upper bound be improved, or is there some storage procedure 
that is better than the obvious one? 

Finally, we have not considered channels for qudits of dimension greater than 2. We expect 
essentially the same classification to apply, based essentially on the "best" two-dimensional subspace 
for the channel. Channels which have a single fixed point at the maximally mixed state should 
still make computation impossible, whereas non-unital channels should still provide some cooling 
and thus make it possible to run exponentially long quantum computations. Unital channels which 
leave some pure states unchanged should act like the dephasing class, allowing polynomially-long 
quantum computations. However, qudit channels are more complicated than qubit channels, so we 
have not proven that these are all the possible behaviors. 
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